Automatic Speaker

نویسندگان

Hubert Jin

Francis Kubala

چکیده

This paper presents a fully automatic speaker clustering algorithm , which consists of three components: building a distance matrix based on Gaussian models of the acoustic segments; performing hierarchical clustering on the distance matrix with the prior assumption that consecutive segments should be more likely to come from the same speaker; and selecting the best clustering solution automatically by minimizing the within-cluster dispersion with some penalty against too many clusters. We applied this automatic speaker clustering technique in 1996 Hub4 evaluation, and the results show that it contributed signiicantly to the word error rate (WER) reduction in unsupervised adaptation. From our experiments , the algorithm seldom misclassiies segments from the same speaker into diierent clusters. We used the same clustering procedure for both partitioned evaluation (PE) and unpartitioned evaluation (UE) tests 1]. Experiments also show that this automatic speaker clustering algorithm improves unsupervised adaptation as much as the hand labeled ideal case where the clusters are generated based on true speaker, channel and background condition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Objective Peak-Detection in Complex Auditory Brainstem Response to /ba/, /da/, /ga/: A Novel Technique

Objectives: The result of auditory brainstem response is used worldwide for detecting hearing impairments or hearing aids. This study aimed to introduce the superiority of mathematical innovation algorithm toward subjective evaluation by an audiologist. The automatic algorithm method is encouraged for detecting the waves of Auditory Brainstem Response (ABR), because it can reduce subjective eva...

متن کامل

Remes Speaker - Based Segmentation and Adaptation in Automatic Speech Recognition

With proper training, automatic speech recognition works quite well when tested in conditions similar to the training conditions, but with a new speaker or a new environment the system performance often degrades. Speaker-based adaptation alters the speech recognition system to better match a specific speaker and thus improves the speech recognition results. In order to use speaker adaptation, t...

متن کامل

A semi-automatic approach for speaker mining of tapped telephone conversations

Speaker mining involves speaker detection in a set of multispeaker files. In previous work on speaker mining, training data is used for constructing target speaker models. In this study, a new speaker mining scenario was considered, where there is no demarcation between training and testing data and prior target speaker models are absent. Given the ENRON database which consists of tapped teleph...

متن کامل

On the Use of Automatic Speaker

In forensic applications of speaker recognition it is necessary to be able to specify a conndence level for a decision that two sets of recordings have been produced by the same speaker (or by diierent speakers). Forensic phoneticians are sometimes incriminated because they nd it impossible to provide 'hard' estimates of the conndence level of an expert opinion. In this paper it is investigated...

متن کامل

A Critical Review on Automatic Speaker Recognition

Automatic Speaker Recognition (ASR) is use to recognizing persons from their voice. Since the voice of every human is not same because their vocal tract shapes, larynx sizes and other parts of a human voice production system. Automatic Speaker recognition is a procedure to automatically recognizing a speaker or who is speaking by the individual information counted in speech signal/waves. Automa...

متن کامل